首页> 外文OA文献 >The Power of Both Choices: Practical Load Balancing for Distributed Stream Processing Engines
【2h】

The Power of Both Choices: Practical Load Balancing for Distributed Stream Processing Engines

机译:两种选择的力量:分布式实际负载均衡   流处理引擎

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

We study the problem of load balancing in distributed stream processingengines, which is exacerbated in the presence of skew. We introduce Partial KeyGrouping (PKG), a new stream partitioning scheme that adapts the classical"power of two choices" to a distributed streaming setting by leveraging twonovel techniques: key splitting and local load estimation. In so doing, itachieves better load balancing than key grouping while being more scalable thanshuffle grouping. We test PKG on several large datasets, both real-world andsynthetic. Compared to standard hashing, PKG reduces the load imbalance by upto several orders of magnitude, and often achieves nearly-perfect load balance.This result translates into an improvement of up to 60% in throughput and up to45% in latency when deployed on a real Storm cluster.
机译:我们研究了分布式流处理引擎中的负载平衡问题,该问题在存在偏斜的情况下会加剧。我们介绍了部分密钥分组(Partial KeyGrouping,PKG),这是一种新的流分区方案,它通过利用两种新颖的技术(密钥分割和本地负载估计)将经典的“两种选择的能力”适应于分布式流设置。这样,它与密钥分组相比可实现更好的负载平衡,而与随机分组相比可扩展性更高。我们在真实和合成的几个大型数据集上测试了PKG。与标准哈希相比,PKG可以将负载不平衡减少多达几个数量级,并且通常可以实现近乎完美的负载平衡,因此在实际环境中部署时可以将吞吐量提高60%,将延迟提高45%风暴集群。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号